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Abstract — In the cryptanalysis of stream ciphers and pseudo- 
random sequences, the notions of linear, jump, and 2 adic com- 
plexity arise naturally to measure the (non)randomness of a given 
string. We define an isometry K on F?° that is the precise equiva- 
lent to Euclid's algorithm over the reals to calculate the continued 
fraction expansion of a formal power series. The continued fraction 
expansion allows to deduce the linear and jump complexity pro- 
files of the input sequence. Since K is an isometry, the resulting 
F2°— sequence is i.i.d. for i.i.d. input. Hence the linear and jump 
complexity profiles may be modelled via Bernoulli experiments 
(for F2: coin tossing), and we can apply the very precise bounds as 
collected by Revesz, among others the Law of the Iterated Loga- 
rithm. 

The second topic is the 2 adic span and complexity, as defined 
by Goresky and Klapper. We derive again an isometry, this time 
on the dyadic integers Z2 which induces an isometry A on F2 00 . 
The corresponding jump complexity behaves on average exactly 
like coin tossing. 

Index terms — Formal power series, isometry, linear complexity, 
jump complexity, 2 adic complexity, 2 adic span, law of the iter- 
ated logarithm, Levy classes, stream ciphers, pseudorandom se- 
quences 
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I. Introduction 



For some prime power q, let ¥ q be the finite field with q elements |17j . We 
consider the set F£° of infinite sequences over ¥ q as our starting point. To 
assess the randomness of such F^°-sequences one computes their linear and 
jump complexity profiles. These profiles as well as the 2-adic complexity 
should behave well in the sense that no large jumps occur and on average, 
for linear or 2-adic complexity, resp., q — 1 out of every 2q resp. q symbols 
should lead to a jump. 

We shall occasionally employ the set of finite words over ¥ q which we 
denote as F* (we do not use the multiplicative group of ¥ q , so no confusion 
should arise). In particular, the empty word e is in F*. 

Dealing with linear and jump complexity, a sequence a = (a±, 02, . . .) G F^° 
is considered as coefficient sequence of its generating function, the formal 
power series 

G: F~ - 8 q 

((h) ^ Y%Li a>i x ~ l 

Here S 9 = {/ G F g [[a; _1 ]]:/ = Y^S=i a i x ~ % -> a i e ^<?l * s a subring (without 
unity) of the ring F g [[x -1 ]] of formal power series. The / G § q have negative 
degree. 

Furthermore, we use the polynomial ring ¥ q [x] and its field of fractions 
¥ q (x). Then, S q n ¥ q (x) denotes the image of the ultimately periodic se- 
quences in F^° under G. 

We define the leading coefficient lc(f) of a formal power series as 

00 

/c(0) = and lcC^a k x' k ) = a, G ¥ q \{0}, 

k=i 

and the degree as 

00 

1 1 = —00 and I ^^afcX~ fc | = — i (with aj 7^ 0). 

k=i 

The analogue for polynomials is the usual degree | Ylk=o a k xk \ := ^> where 
the leading coefficient lc(Y2k=o a kX k ) '■= a-d again is assumed nonzero. The 
degree fulfills the ultrametric inequality |/± <?| < max{|/|, |^|} and equality 
I/I 7^ \d\ =^ 1/ =t 5l = max {|/|) l<?|} ( we shall not use the associated norm). 
A function /: F^° — * F^° is then called an isometry, if it preserves distance 
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that is for all a, b G F^°: \a — b\ = \f(a) — f(b)\ (coordinate-wise subtraction 
in F g ). 

The outline of the paper is as follows: 

The next section introduces an isometry K on F^°, where K(a) describes 
the partial denominators of the continued fraction expansion of G(a). As 
isometry, K is information preserving and we shall see, how each symbol of 
K(a) describes precisely what can be said at that moment about the partial 
denominators. 

Section III makes the connection to linear and jump complexity and gives 
the wellknown Euclid-Lagrange-Berlekamp-Massey-Dornstetter algorithm 
in a form that delivers precisely K(a) as discrepancy sequence. 

The section IV is a compilation of consequences of K being an isometry. 
Some of these results are already known but the proofs using K are shorter. 

The main point of the paper is stated in section V. Since K is an isometry, 
an i.i.d. sample a 6 F^° leads to an i.i.d. resulting K(a), and thus on average, 
K(a) can be described by a Bernoulli experiment (in the binary case q = 2 
just coin tossing) and we may apply the sharp known bounds (Levy classes) 
for coin tossing to linear and jump complexity. 

Section VI changes the focus to the 2-adic complexity as defined by Klap- 
per and Goresky. Again, there is an isometry on F^° associated to 2-adic 
complexity and the induced jump complexity turns out to behave on average 
just like the "linear" jump complexity. 

We state an open problem and finish with the conclusion. 

II. The Isometry K 

In this section we will obtain, for a sequence a G F£°, its generating function, 
the expansion of the latter into a continued fraction, and finally an encod- 
ing of the partial denominators of the continued fraction. The whole process 
will turn out to define an isometry on F^° , which we call K ( "Kettenbruch" ) . 

A . Continued Fractions of Formal Power Series 

We start with the continued fraction expansion of a formal power series 
G(a) G S\{0}. For £ G F^aT 1 ]] (not only in S), £ = ££L n a fc ar* we 
define the integral part as [£J := Y^l=n a k x ~ k ■ L?J i s zero for n > 1 and a 
polynomial of degree — n for n < 0. 

Given £o := G(a) G 8\{0}, we now proceed iteratively to obtain Ai(x) = 
ld\ G ^q[x] and & + i := (& - A i (x)y 1 G S. We discard A (x) which is 



3 



always zero, and obtain a sequence (Ai(x), ^(xc), . . .) of polynomials with 
positive degree which are called the partial denominators of the continued 
fraction expansion. This sequence is finite, if G(a) € ¥ q (x) (that is, if a is 
ultimately periodic), it is infinite otherwise. More on continued fractions can 
be found in Perron [SUj over the reals, and in de Mathan |2U1 ch. IV] or Artin 
^ §12 f.] for formal power series. 
We can now write the formal power series G(a) as 

„, N <^ -i 1 1 I 1 I 1 I 

G(a) = > djX = — — = =: — - . + . , . r I . ~. , . +... 

U Mx) + A , (x) + i M*) Ms) Mx) 

More formally, we state this map from §\{0} to sequences of polynomials 
with positive degree in (F g [x]\F 9 )* U (F^x^F^) 00 as operator /C: 

K: (§,nF,(x))\{0} | - (F 3 [x]\Fq)* 

Z)i=l a i x % = ]aTR + \M(x) + • • • + jA^xj ( A i( x ))i=l 

K: S\F 5 (x) -> (F,[ar]\F 9 )°° 

E£i o^ - * = -pTR + ]Mx) + l^sj + • • • ^ (4i(a0)~i 

We further define (0°°)^ ^ e e (F,[x]\F g )*, the empty sequence of 
(no) polynomials. 

The next step gets us back to F^°. We encode the polynomials with positive 
degree by words from F* where a polynomial of degree d > 1 is encoded 
in 2d symbols from ¥ q , the first d symbols determining the degree and the 
second half determining the coefficients. Let 

Ilq := {(a x , . . . , an) € F* | 3d € N : n = 2d, a x = . . . = a d _i = 0, a d + 0} 

be the set of all allowed encodings of polynomials. Then we define the en- 
coding function n and the subdivision into degree and coefficient part as: 

7r:F ff [or]\F, -» U q C F*, Eto«^ ^ O^ 1 a d a d ^ . . . ai a £ ¥ 2 q d 
ttj,:F,[x]\F, F*, Elo^ d - l a d eF d q 

Tr c :¥ q [x]\¥ q F*, Eto°^ " a d ^ 1 ...a 1 a e¥ d 

(a d + 0) 

Hence ir induces a function ir°° on the set (F g [x]\F 9 )* U (Fq[x]\F g )°° of 
finite or infinite sequences of polynomials with positive degree as 
n°°:(¥ q [x]\¥ q y -+ F~ » n(A 1 )\ . . . \ir(A k )\QP° 

^:(¥ q [x]\¥ q r - F-, (A^ =1 -> 7r(^)K(A 2 )|... 

where | indicates concatenation of elements from F*. 
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In a similar way, 7r£° and tt'q are built up from the ttd and ire- 
The set U' g := Tl q U {0 00 } is a complete prefix code for F°°, i.e. every se- 
quence a £ F°° can be decomposed in exactly one way into elements from 
U' q and hence tt~°°:¥^ -► (¥ q [x]\¥ q )* U (F,[x]\F,)°° is bijective. 

B. The Continued Fraction Operator K. F°° — »■ F°° 

We thus map the continued fraction expansion of a generating function 
back into the space F°° and define the Continued Fraction Operator K on 
F°° as 

K: F~->F~, K = 7r°°o/CoG. 

The subdivision of the encodings it and 7r°° into the parts ttd^c an d 
7r § > ! 7r (7 > ' res P-) define the degree and the coefficient part of K as 

K D = tt£ o K o G, 

K c = tic % G. 
We define K for finite (prefix) words by 

Va G F* : K(a) := K(a|0°°)li 

(we will see in Theorem 5 that the continuation 0°° after a is irrelevant) and 
we obtain the inverse operator as 

K 1 = G^ 1 o KT X o 7r~°°. 

(Kd and Kc are not injective and thus Kd _1 , Kc -1 do not make sense) 

We give two examples for K, K -1 : 

(i) Let a = (a;)^ = 1(110)°° € F°°, then 

G(a)(x) = + x' 2 + x~ 3 + x' 5 + x' 6 + x" 8 + x' 9 + . . . 

1 x~ 2 + X~ 3 x 2 + 1 X + 1 I 



X ' 1 + X 3 X 3 + X 2 +X X + l X + 1 + -2+t ' 

fromx 3 + x 2 +x = (x 2 + l)(x+l) + l. Thus /C(G(a)) = (x+l,x 2 + l) G F 2 [x] 2 
and K(o) = vr 00 o /C o G{a) = 1101010°° G F°°, where 11 = vr(x + 1), 
0101 = vr(x 2 + 1). 

(ii) Let a = 11011001001011 . . . € ¥f. By repeatedly taking the integral 
part and inverting, we obtain 



1 ! 1| 1 1 

G(a) (x) = f — I + ^ + 1 + — — 1 + 

X + l X X d + X + 1 x + l 
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The encodings of the partial denominators are tt(x + 1) = 11, ir(x) = 
10, ir(x 3 + x + 1) = 001011, and again tt(x + 1) = 11, hence K(a) = 
111000101111... 

Let Ai be the partial denominators of G(a). Then we iteratively obtain 
convergents jf- to G(a) using Pj := Ai-P^i+Pi^ and Qi := Ai-Qi-i+Qi_ 2 



Q-2 



1. 



□ 



with the initial conditions P-2 = Q-i = 0, P_i 

The recursion for the Pj , Qi leads us to 

Theorem 1 P n ■ Q n ^ - P n _ x ■ Q n = (-l)"" 1 for n > -1 

Proof. By induction on n. See also 30 , §6]. 

The next theorem gives a bound for the precision of the approximation of 
G(a) by f . 

Theorem 2 Let ^ be a convergent to G E § with G / ^. Then 

^k tyk 

(i) |G - §| = -\Q k \ - \Q k +i\ < -2 • \Q k \ 

(ii) For fceN and all Z,N £ ¥ q [x] with < \N\ < \Qk+i\ we have: 



-\Qk\ — \Qk-\ 









G 2 


> 


°~ N 




Qk 



Proof. (compare [2U1 p. 69-74]) 
(i) Let Ai as before. By induction on i we obtain the equation 



and hence 



G P * 



Vi G N n : G 



& • + Pi 



i-2 



Ci ' Qi-1 + Qi-2 



i+l ' 



P + P 



i-1 



P 



£i+l " (PiQi — PiQi) + {Pi-lQi ~ PiQi-1, 



ii+l ■ Qi 2 + QiQi-1 

-l) l |-2|Q,|-|& +1 | 

"IQi+ll _ \Qi\i 



where we have used Theorem 1 and = |^4j+i| = |Q«+i| — \Qi\- 

(ii) See [23 p.221, Th. B.l]. □ 
The approximations 5^ are called convergents. Furthermore, there may 

occur intermediate results |20l p. 71], which we shall call subconvergents: Let 
=d a i ^ ' xJ ^ e a partial denominator, that is Pi — Ai ■ Pj_i + Pj_2 



6 



and Qi = Ai-Qi_i + Qi_2, resp., a (main) numerator and denominator, resp. 
Then for each k = \Ai\,\Ai\ — 1, . . . , 2, 1 we can define an auxiliary partial 
denominator A\ (x) := Yl^k a i ' x ^ that in turn defines a subconvergent 



i-2 



In the preceeding example, A2(x) = x l + x + I and = ^ij^r- Prom 

A2 {x) = x 2 and A 2 (x) = x 2 + x we obtain the subconvergents 



P 2 (2) (x) _ x 2 • 1 + _ x 2 ^ d 

q( 2 \x) x 2 ■ (x 2 + x + l) + 1 ~ x 4 + x 3 + x 2 + I an 

P 2 (1) (x) (x 2 + x)-l + x 2 + x 



qW/j.) (x 2 + x) ■ (x 2 + X + 1) + 1 X 4 + X + I' 

We will see in the proof of Theorem 5 that the subconvergents have 

a precision between that of the convergents n ' -1 and 5-. All convergents 
including the subconvergents will be obtained during the calculation via 
Euclid's algorithm. 
Of course, Theorem 1 is also valid for subconvergents 

if) • g,_! - p,_! • q? ) = (-ir\ 

(again by induction, on k, for fixed i) see also Carter jSJ Lemma 4.2.1]. 

C. K is an Isometry 

In the sequel, we will see that the n-th symbol K(a) n depends exactly on 
a%, . . . ,a n , but not on a n +i, .... In particular, this means that at the end of 
K(a) for a € F* we might have an incomplete encoding. This then describes 
exactly what is known at that position about the partial denominator. 

In the next theorem, we show that K _1 is an isometry on F?° that is, two 
inputs differing for the first time in position n lead to outputs also differing 
here, but not before. 

Proposition 3 "The Ultrametric Square" Let P,Q,R,S £ F^° be four 
"points" with \P -Q\,\R- S\ <\P - R\. Then \Q-S\ = \P-R\. 

Proof. By assumption and the ultrametric inequality |P| 7^ \Q\ =^ \P ± 
Q\ = max{|P|,|Q|}, we have \P - S\ = |(P - R) + (R - S)\ = max{|P - 
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R\,\R-S\} 
\P-R\. 



\P-R\. By the same reasoning \Q-S\ = \{Q-P) + (P-S))\ 



Lemma 4 Let c G with K 



a, n G N, and d G F^° u>it/i d; 



/or i / n, d n G F 9 \{c n } ; and 6 = K" 1 ^) G F^°. T/ien |a 
Proof. Let the continued fraction of G(a) be G(a) : 



b\ 



-n. 



7r(Ai)|7r(A 2 )|7r(^ 3 )| • • • Similarly, let G(b) 



\A 3 (x) 
1 I 



+ 
+ 



Then we have c 



□ 

Ci 

+ 



+ 



+ ... and thus d = 7r(Bi)|7r(B 2 )|7r(5 3 )| .... Now let 
= Bi for i < k and Aj. ^ B k . This implies |Qfc_i| + 



|Bi(aO ^ |B a (a;) ^ \B 3 (x) 

k G N be such that 
IQfel < n < 2\Q k \ (c n is lc(A k ) or part of 7r c (^ fc )). W.l.o.g. let \A k \ < \B k \. 
We have \G(a)-^\ = -|Q fc |-|Q*+il < -2\Q k \ < -n by Theorem 2 (i). We 
consider the ultrametric square made up of G(a), G(b) and their convergents 
q^iTT- First we treat the case \A k \ = \B k \. Then B k = A k + {d n — c n )x d 
with g = 2\Q k \ — n (g more symbols after c n until the end of Tr(A k )). 



Then 



G(b) 



and 

9- 



Pk+x g P k - 



> k +xsQ k _ 1 

2\Q k \ = -n. 



'Qk 



Qk+x g Qk-i 
PkQk+^Pk-iQk-PkQk-x 9 PkQk-i 



■\Qk\-\Qk+i\ < ~2\Q k \ < -n 



Qk(Qk+x a Qk- 



Altogether we have an ultrametric square where \G(d) — 



Qk(Qk+x3Q k _ 1 
•<-n,\G(b)- 



Qk 



I ST ~ 7r\ = ~ n > an< ^ w ith Proposition 3 now follows 



' Qk 

\G(a) 



G(b)\ 



b\ 



-n 



\c-d\ = |K(o) -K(6)|. 



Let now \A k | < | B k \ . In this 

Pk-l I . 



lc(A k ), d n 



Then 



G(a)- 



Qk- 



|Qfc-i| - \Qk\ = ~n and G(b) 



and n = 

Pk-i 



Qk- 



\Qk-i\+\Qk\ 
= — |Qfc-i| - 



□ 



\Qk\ < —n and thus as before G(a) — G(b)\ = —n. 
In the sequel we shall say that we are in case A, B, C, resp., if c n is 

A) a zero of some ttd (in part (ii) of the above example for n = 5,6), 

B) the last coefficient (for n = 2,4, 10, 12), 

C) the leading or some other coefficient except the last (for n = 1,3, 7. ..9, 11). 
Theorem 5 Let neN and c, d G F2° with di = Ci for i < n, c n ^ d n . 

Let a = K-^c) G F°° and b = K^d) G Th en 



\a-b\ = |K(a)-K(6)| = -n. 



Proof. We shall change from (cj) to (dj) one symbol at a time, using 
Lemma 4. Let df ] = \ dil l ~ k Then = c and lim^^ d« = d. Let 
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b (k) = k- 1 ^)). Then \a-b^\ = -oo, since a = &(°). Furthermore, \b^ - 



Hence \a — b^\ = —n (first difference between c and d), and |&( n+fc ) — 
&(n+fc+l)| < _ n _ k _ x < _ n for k e Nq _ 

By the ultrametric equality, we thus have | a — b^ n+k ^ \ = \ (a — b^ ) + (b^ — 
&( n+1 )) + . . . + (U n+k ~ 1 ^ - U n+k ^\ = -n for all k G N, that is after the first 
impact of changing c n into d n , the further changes are "absorbed" by the 
ultrametric. In the limit k — > oo, we obtain \a — b\ = —n = |K(a) — K(b)|. □ 

In summary, the first n symbols of a determine the first n symbols of K(a) 
and vice versa. At each step, the resulting symbols from ¥ q encode all infor- 
mation available at that point about the continued fraction expansion: In 
case B, we just have the complete last determined partial denominator, in 

case A, the next partial denominator will have degree at least n — 2|<5fc i| 

and that is all we can say at the moment, and in case C, the missing coeffi- 
cients of the current partial denominator are as yet unknown and all possible 
values will be assumed for a suitable continuation of a. 

We have thus obtained the fundamental result of the paper: 

Theorem 6 

The function K: — ► is an isometry on F^°, for every finite field ¥ q . 

Remark We call the domain of K (range of K _1 ) coefficient space and 
the range of K (domain of K" 1 ) discrepancy space (as in the Berlekamp- 
Massey-Algorithm) . 



We recall the notion of linear complexity L, give the connection between 
K and L, and adapt the Berlekamp-Massey algorithm to produce precisely 
b = K(a) as discrepancy sequence on input a. Furthermore, we describe an 
alternative method, due to Niederreiter and the author, to calculate K(a) 
by means of the shift commutator [K _1 ,cr]. 

A. Linear Complexity 

We define the linear complexity profile of a sequence a G as L a : No — > 
No with L a (0) = and for n > 1 let L a {n) = J2i=i where A^ is the last 




from ^ = d^ and by Lemma 4, respectively. 



III. Linear Complexity, Euclid's Algorithm 
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partial denominator whose leading coefficient is encoded in K(a)i i = i.„ n . 

The sequence (L a (n)) n >o is called the linear complexity profile of a. (L a (n)) 
is monotonously increasing and jumps, L a (i) > L a (i — 1), where K(a)j en- 
codes a leading coefficient. We next show that the usual definition in linear 
feedback shift register (LFSR) theory is equivalent: 

Theorem 7 L a (n) denotes the length of a shortest LFSR, that produces 
a 1 . . . a n . 

Proof. Given the formal power series G{a) with the convergent numer- 
ators and denominators (Pk) and (Qk), resp., we consider the LFSR with 
normalized feedback polynomial (/c(Qfc)) -1 Qk of length \Qk\ = J2i=i IA| = 
L a (n). This LFSR produces, for a suitable initial content, a sequence b with 
G(b) = From \G{a) — ^\ = —\Qk+i\ — \Qk\ < ~ n now follows that this 
LFSR will produce a (at least) up to a n . On the other hand, 

" 77^" = ~\Qk\ ~ \Qk-x\ = -2|Q fc -i| - \A k \ > -n, 



Q 



k-l 



since the leading coefficient of lies in K(a), ) j = i... n . Hence the previous 
convergent g^~^ is not sufficiently precise and by Theorem 2(ii) also no in- 
termediate LFSR length will do. □ 



We see that typically 2 • L a (n) w n, since with L a (n) = J2i=i 1^*1 we n eed 
just 2 • L a {n) symbols to encode A\, . . . , A^, hence the deviation from the 
sequence length n consists only in the length of the last incomplete encod- 
ing of a partial denominator (missing coefficients in case C of Lemma 4, 
exceeding zeroes in case A). Hence we define: 

Given a sequence a with linear complexity profile {L a {n)) we define the 
linear complexity deviation of a at n as 

Tn a (n) := 2 • L a (n) - neZ, n > 0. 

Remark Comparing with the three cases of Lemma 4, we have: 
m a (n) < in case A, m a (n) = in case B, m a {n) > in case C. 

B. Euclid's Algorithm 

History Euclid ("Elements", 300 BC) invents his algorithm to calculate 
the gcd of two natural numbers. Lagrange (1770 AD) has our Theorems 1 
and 2, all this over the reals. 

Berlekamp [2] describes in 1967 a decoding method for BCH-codes over 
arbitrary finite fields. In 1969, Massey JH] uses this method to obtain a 
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shortest LFSR producing a given sequence. In 1979 Welch and Scholtz |37| 
make the connection with continued fractions and they show that the main 
convergents appear as polynomials in the BMA. 

In 1987 Dornstetter [Hj gives the precise equivalence between the algo- 
rithms of Berlekamp and Euclid, already mentioning the subconvergents. 

During 1988 - 1991 Niederreiter [21] [22] [23] [2H] as well as Dai and Zeng jS] 
then show the detailed connection between linear complexity (L a ) and the 
continued fraction expansion of G(a): A jump by k in the profile corresponds 
to a partial denominator of degree k. Furthermore, in 5 a connection be- 
tween subconvergents and discrepancies is given for the case F2. 

Before giving the implementation in a form that exactly delivers b = K(a) 
as discrepancy sequence, we shall recompile the changes necessary in com- 
parison with |19j : 

(i) The main convergents have to start with P_2 = 0, P_i = 1, Q-2 = 1, 
Q—l = as is the case over R [20] • From Aq = we also have Pq = 0, 

Qo = h 

(ii) we use the feedback polynomial, not its reciprocal, the "connection poly- 
nomial", and 

(Hi) the feedback polynomial will not be normalized. 

In this way we will obtain all partial denominators and all convergents and 
subconvergents and the discrepancy sequence is the encoding K(a) of the 
partial denominators. 

Euclid-Lagrange-Berlekamp-Massey-Dornstetter- Algorithm 
START 

Input a(l,...) // elements of ¥ q 
P = 0, AP =1// numerator 

Q = 1, AQ = 0, // here Q=Qo, AQ = Q_i as initial values 

d = 0, m = 0, j = 0, r = — 1 

/ /loop invariant : m = 2 • d — j, d = \Q\ 

DOj = l,... 

b ti) = EiLo Qtt) ■ a ti + i-d) //equals £ Q(d - i) ■ a(j - i) 
CASE (b(j),m) IS 

(0, • ) : 

m = m — 1 
(/0,>0) : 

m = m — 1 

6 = r • 
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P = P + b ■ x m ■ AP 1 1 numerator 
Q = Q + b ■ x m ■ AQ 
(^0,<0): 

m = — (m — 1) 

b = r 

r = b(j) ^ 

b = b • r 

Ptmp = AP, AP = P, P = P tmp II numerator 
P = P + b ■ x m ■ AP 1 1 numerator 

Qtmp — AQ 7 AQ — Q, Q — Qtmp 

Q = Q + b ■ x m ■ AQ 
d = d + m 

END CASE 

/ /loop invariant : m = 2 • d — j, d = \Q\ 
Output b(j), P, Q 
END DO 

P/Q is the actual (sub)convergent, AP/AQ the previous convergent, (b(j)) = 
K(a(j)). The lines with comment "//numerator" are not necessary to ob- 
tain (bj). For F2 we have b = b(j), b(j) ^ b(j) = 1, and r = r = 1, 
which simplifies the program. 

C. The Shift Commutator of K 
Another method to actually compute K uses the shift commutator 

[K-\a] =Ko (J - 1 or 1 o(T. 

Let some w = K(«) be given, e.g. O 00 = K(0°°), then K(av) = [K _1 , o](aw) 
for a G F q , since 

aw —* w v av K.(av) = [K _1 , a}(aw) 

[K _1 ,cr] can be computed by a transducer with finite state space and an 
up-down- counter in amortized linear time (for a detailed description see [27] 
for the case F2 and |28j for general finite fields). Hence we can compute K in 
amortized quadratic time by repeated application of [K _1 ,cj] via the above 
formula. We thereby obtain with no additional cost all continued fractions 
of all shifted sequences (oj, . . . , a n ), (02, . . . , a n ) . . . (a n _i, a n ) as well. 
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IV. Consequences for Linear and Jump Complexity of K being Isometry 



We derive the partition of K into Kd and Kc from the behaviour of the 
algorithm in III.B. Furthermore, we derive from Kd the linear and jump 
complexity profiles and some combinatorial results. 

A. Translation Theorem 

Theorem 8 For every length n G N and every sequence prefix a € Fq we 
have: 

(i) m a (n)>0: rn a \ an+1 (n + 1) = m a {n) - 1. 
(ii) m a (n) < : 

3ia G F q : m a \ a (n + I) = m a (n) - I, 

V a' 7^ a : m a\a'( n + 1) = 1 — m a (n). 
(Hi) L a (n) > n/2 : L a \ an+1 (n + 1) = L a (n). 
(iv) L a (n) < n/2 : 

3ia G F q : L a \ a (n + 1) = L a (n), 

V a' ^ a : L a \ a i(n + 1) = n + 1 — L a (n). 
(In F2 obviously Vc/ ^a<^a' = a + l) 

Proof. (i) Since m a (n) > 0, when running the algorithm from III.B, 
only the cases (6, m) = (0, •) or (b, m) = (7^ 0, > 0) may appear. Hence 
m(n + 1) = m(n) — 1. 

(ii) For exactly one choice a G F q as a ra+ i we have 6 n _|_i = 0, hence the 
case (6, m) = (0,-) and m(n + 1) = m(n) — 1. All other a' ^ a lead to 
frn+i 7^ and (6, m) = (7^ 0, < 0), thus m(n + 1) = 1 — m(n). 

(iii) ,(iv) are equivalent to (i), (ii) by the definition of m a , see also [331 
p. 34]. □ 

For the following theorem and again in Section V, we need a measure on 
F^°. We define the measure fi on F q as equidistribution, n(a) = ~, V a G F g . 
Taking the infinite product Haar measure of fj,, we obtain the measure 
on F£°. A set A = {a G F^° | a* = bi, 1 < i < k} for fixed h G F q , is called a 
cylinder set and it has measure ^°°(^4) = The set of ultimately periodic 
(rational) sequences in F^° is countable and thus has measure zero. 

Theorem 9 Translation Theorem 
Let a = (cti, . . . ,a/~) and (3 = (J3i, . . . ,/3i) be two sequences from F* with 
maik) = mpQ). Furthermore, let A = a|F^° and B = /?|F2° be the cylinder 
sets of the sequences in F^° starting with a or (3, resp. Then we have for all 
t G Nq and for all d G Z: 



13 



(i) | {a eFj +i | a,i =a.i, i < k, m a (k + t) = d} \ 
= | {b € Fj+* \h = pi, i< I, m b (l + t) =d}\ 

fi°°({a € A | m a (k + t) = d}) _ fi°°({b 6 B \ m b (l + t) = d}) 

In other words: The distribution of deviations m after a given prefix de- 
pends only on the m at the end of that prefix; it does not depend on the 
length or the particular symbols of this prefix. 

Proof. 

(i) By induction on t: For t = both sets contain exactly one element 
for d = m a (k) (which is a resp. (3 itself) and for d ^ m a {k) both sets are 
empty. The step from t to t + 1 follows from Theorem 8(iii,iv). 

(ii) This is just a measure theoretic reformulation of the result in (i). □ 
Corollary 10 Let m s (2k) = for a sequence s of length 2k (thus L s (2k) = 

k). Then the distribution of linear complexity deviations on the cylinder set 
s|F£° equals that on F^°. 
Example Extending s = e (empty sequence) and s = 10, resp., by 3 bits: 



Sl 


S2 


S3 


m(3) 


si 


S2 


S3 


S4 


S5 


m(5) 











-3 


1 














-3 








1 


3 


1 











1 


3 





1 





1 


1 








1 





1 





1 


1 


1 


1 








1 


1 


1 


1 








-1 


1 





1 








1 


1 





1 


1 


1 





1 





1 


-1 


1 


1 





1 


1 





1 


1 





-1 


1 


1 


1 


-1 


1 





1 


1 


1 


1 



B. Partition of K into Kd and Kc 

We need a slight modification of m to be able to relate the positions in K 
to those in Kd,Kc- Let 

m'(n) = m(n — 1) — 1, n > and m'(0) = 

(m! is equal to m, except for the positions of the leading coefficients in K, 
since here m(n) = d, but m'(n) = — d, with d the degree of the current 
partial denominator). 

Theorem 11 Using m' we have the following connection between the linear 
complexity deviation and the distribution of K onto Kd and Kc •' 
m'{n) < : K(a) n = K D (a) (n _ m / (n))/2 
m'{n) > : K(a) n = K c (a) (n _ m / (n))/2 
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Proof. By induction on the codings ir: Let some it end at the (even) 
position 2k. Then Kd and Kp both contain k of the 2k symbols K(a)j i=i...2k 
up to that position and K(a)2fc = Kc(a)fc. Also, before the first coding we 
have k = and Kd = Kc = e empty up to now. 

The next d symbols (d the degree of the next possibly incomplete next 
partial denominator or the length of the terminating zero run) lie in 7Td and 
yield m'(n) = m'(2k + i) = — i < 0, 1 < i < d, hence K(a)2fc+i = Ko(a)fc+i 

= K D(a)(2fc+i-(-i))/2 = K D(a)(n-m'(n))/2 toi 1 < i < d. 

For 2k +d < n < 2k+2d, now follows the Kc part (K(a)2k+d was a leading 
coefficient), and the positions n = 2k + d + i,l < i < d (as far as present) 
have m'(n) = m'(2k + d + i) = d — i > and ~K(a)2k+d+i = Kc(o)t + j = 
K c(a)(2fc+d+i-(d-i))/2 = Kc(o)( n _ m /(„))/2 for 1 < i < d. □ 

C. Kd and the Linear Complexity Profile 

Theorem 12 The distribution ofm a (t) on the seiF* of all prefixes of length 
t > can be calculated via K and Kd, by regarding F* as discrepancy space. 

(i) For even t > i/iere are (q — 1) • q* -1 sequences with m(t) = 0. 

(n) For > m(i) = t mod 2 wii/i |m(t)| < t there are (q — 1) • *- 1 + m (*) 
sequences with m(t) > —t and one sequence with m(t) = —t. 

(Hi) For < m(t) = t mod 2 with \m(t)\ < t iaere are (g — 1) • g> i ~ m (*) 
sequences with m(t) = t. 

Proof. The set F* of all sequence prefixes of length t yields also F* as set 
of all discrepancy prefixes, since K is an isometry. 

(i) Since m(t) = 0, both Kd and Kc have | symbols up to now, which 
(only) requires KD(a) t / 2 7^ to be a leading coefficient. The other t — 1 
symbols are arbitrary, hence we get (q — 1) • g t_1 possible sequences. 

(ii) With m(t) < 0, m(t) = m'(t). For m(t) ^ -i, K(a) i)i=t _ m(t )_ 1 ... t = 
K D(fl)i,i=(t+m(t))/2...((-m(t))/2 = IcO^^ , for some /c/ 0, thus (q-l)-l^ m ^ ■ 
g*— |m(*)|-i caseS; and for m ( t ) _ _ fj K(a) M =i...t = K D (a) Ii « = i.. i = 0* (1 
case). 

(Hi) Since m(t) > 0, the last Kd has encoded a polynomial of degree m(t) 
or higher and thus ended in m ^~ 1 a,a 7^ 0. The initial part of Kd and 
all of Kc are irrelevant for this m(t), and we have l m (*) _1 . (q — 1) . g i_m (*) 
possible sequences. □ 

Corollary 13 (see also Gustavson [T3] ) 
Let N(t,m(t)) be the number of sequences with a given length t and value 
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m(t). Then for all t G No and m(t) with —t < m(t) < t and m(t) = t mod 2 
N(t,m(t)) = 



1, m(t) = -t 

(?-!)• g* _ l m W-5l-l 3 otherwise □ 



The following theorem will finally be the foundation for Theorem 16 and the 
following section. We shall see that global statements about the behaviour 
of m, L and J are considered advantageously in the discrepancy space. For 
equidistributed a the resulting Kd(o) in the discrepancy space are also 
equidistributed by the following theorem (with respect to K(a) we have 
equidistribution anyway, since K is isometry). 
Theorem 14 

Vn G N ,V6 G F™ : | {a G Ff | K D (a) M=1 ... n = b}\=q n 

The q 2n sequences a are thus equidistributed concerning the Kd part of 
their partial denominators. 

Proof. Instead of a G F^ n we consider the sequence a' = K(a) G F;? n in 
the discrepancy space. Since K is an isometry, every a' G Fg n occurs exactly 
once for an a and the theorem is equivalent to 



Vn G N ,V6 G F£ : | {a' = K(a) M=1 ... 2n G Ff | K D (a) M= i...„ = 6} | = g n . 

We thus have to consider only the partition of K(a) into the parts Kd 
and Kg- First of all we have |Kd(o)| > |Kc(a)|, such that 2n symbols of 
K(a) include at least n symbols of Kd(o). 

Now let b = K D (a) be given. Then, by Theorem 11 the indices ti, . . . , t n 
with K(a)t i = Ko(a)i = 1 < i < n are fixed, and the other symbols 
K(a) s , s tj,Vi, can be chosen arbitrarily, that is in q n ways, without 
affecting Ki}(ct)i,i=i...n- This proves the theorem. □ 

Theorem 15 

(i) Ko(a) n / i/ and on/y i/ t/ie linear complexity profile assumes the 
value n. 

(ii) We can infer the whole linear complexity profile already from Kd alone 
(compare Wang j^Hl Th. 2.4]). 

Proof. 

(i) The linear complexity profile assumes the value n, if one of the sums 
If. := ^2 i= i \Ai\ is equal to n. In this case Kd starts with 

Q\M-ll ClQ \A2\-l lc2 ^ \A k \-l lck ^ 
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where lei = lc(Ai) G F ? \{0} are the leading coefficients, and thus K.r>(a) n = 
lc k ^ 0. 

On the other hand, all indices n with l k < n < l k+ i for some k lead to 
Kd(a)„ = 0, since this element is in the o' j4fc+1 ' _1 part. 

(ii) By (i) we know from Kd, which linear complexities l\ < I2 < ■ ■ ■ G N 
occur at all, and we put Iq = 0. 

At the end of each ir, m is zero, hence Vi £ No : m a (2 • Zj) = and 
L a (2 ■ li) = li. For those n S N with n ^ li for all i £ N„ there is a 

with 2 • Z fc < n < 2 ■ l k+l and then L a {n) = { \ k ' k < l k + h+i 

L lk+l, k > l k + l k+ i 

K ( a )i k +h+i = Jc(Afc+i)). □ 
In the next theorem we shall obtain the number of occurences of values 

m(t) by counting strings in Kd- 
Theorem 16 

(i) A linear complexity deviation m a = k < occurs wherever Kd contains 
a string a0' fc ',a 7^ 0, or i/a begins with 0' fc L 

(ii) ^4 linear complexity deviation m a = k > occurs wherever Kd con- 
tains a string k ~ 1 a, 

Proof. 

(i) The case a = 0' fc ' ... is obvious. When Kd(o) contains an a0' fc ', then 
a terminates a ttd and o' fc ' leads to a complexity deviation m = k < 0. 

On the other hand, by Theorem 9 this m a must result from a 7Td, which 
begins with o' fc ', thus either Kd = 0' fc ' (at the beginning of a) or Kd contains 
the string a0' fc L 

(ii) A complexity deviation m a = k > occurs exactly after a jump to 
k or higher, hence in a coding 7r(ylj) = d ~ 1 CdCd-i ■ ■ - Cq with d > k, at □ 

-D. Iterated Application of K 

Pseudorandom sequences should avoid easily guessable patterns, for in- 
stance neither a, nor K(a) should terminate in 0°°. Since K is an isometry, 
we can apply K again on K(a) to obtain K 2 (a) which ends in 0°° exactly 
for quadratic-algebraic a (with rational K(a)) and also is far from random. 
In fact, for every exponent k, the sequence K fc (a) should be well-behaved, 
that is look random. 

Hence, defining K°° as (K°°)(a) fc := (K k )(a) k , also K°° should behave 
well. 

Conjecture Whenever K* (a) is rational for some i, K°°(a) is algebraic. 
Then by a result of Christol et al. (I], K°°(a) has finite tree complexity 
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|29| . which can serve as a means to determine this a as nonrandom. 

E. Jump Complexity 

We represent the jump complexity J a (t), as introduced by Carter and 
Wang |35| via K by the number of nonzero symbols in Ko(a). The jump 
heights then will be lengths of zero runs in Ko(a). This prepares the global 
results on J, m and L in the next section. The Jump Complexity 

J a (n) ■= | {k | 1 < k < n A L a (k - 1) < L a (k)} |, n G N 

counts the number of jumps in the linear complexity profile of the sequence 
(ai, .. . ,a n ) (see Carter 
Theorem 17 Jump complexity and the discrepancy space 

(i) The distribution of jump heights corresponds to the distribution of zero 
runs in Kd : every zero run of length k — 1, k > 1 corresponds to a jump 
by k. 

(ii) The average distribution of jump heights on F?° can be modelled by the 
distribution of zero runs in Kd- Hence a jump by k occurs with probability 
p(k) = (q-l)/q k . 

(Hi) J (2-n) = \{i | K D (o)j ^ 0, 1 < i < n}\+5, with 5 = 1 form a (2n) > 
and 5 = otherwise, 
(iv) We set 



(i) Every jump by d corresponds to a -kd of 1 a,a G F g \{0} that is a 
sequence (5Q d ~ l a, a, (3 G F g \{0} in Kd (the first ttd is not preceeded by 



(ii) This follows as a global statement from part (i), see also Selmer 
VI,4] (Golomb's Theorem). 

(Hi) Every jump produces exactly one nonzero symbol in Kd- In the case 
m a (2n) > 0, the nonzero corresponding to the last jump is KD(a) n + m (2n)/2 
after KD(a)j 1 j=i... n and hence must be accounted for by adding 5=1. 

(iv) The statement follows from (Hi) and Theorem 14 (see also 5.3]).D 

Theorem 18 (Carter [H] for F2, Niederreiter for ¥ q , V q) 
The expectation for the jump complexity is 




Then\{a G F^ | J' a (2n) 



ill = l{^ ^ I & contains exactly j nonzeroes}\ 



Proof. 



a /?). 
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(i) for even t 

-ftt t q-1 1 1 

J(t) = o • — + 



2 q q+1 (g + l)-g*' 
(m) and /or odd i 



i g - 1 q 2 + 1 



2 q 2g-(g + l) (g + l)-g r 
Proof, (i) The first i bits of K(a) contain at least | bits from Kd(o), of 



2 

whose on average | • are nonzero and thus mark a jump. Furthermore, 
if m a (i) > there was another jump in Kd(o) after |. Since t is even, we 
obtain with Corollary 13 that 

m=2,4,6...t m'=0 

of all g* sequences in F* will deliver one more jump, thus in total 



i g - 1 1 g { - 1 i g - 1 1 1 



imilar to (i) we get 
positive odd m a (t). With 



2 g g* g + 1 2 g g + 1 (g + 1) ■ g* 

±1 q-± 
2 " g 

(t-3)/2 



(m) Similar to (z) we get ^jp • ^-i jumps on average and one more jump for 



V " (n 2 \(t-l)/2 1 _t-l 1 

£ (?-l)-^ m = (9-1)- £ ^) m ' = (g-l)- g2 ~ = ^T-T 1 

m=3,5,7...t m'=0 9 9 "r 



we thus obtain 



t + 1 g - 1 1 g*- 1 - 1 t g - 1 g - 1 1 1 



jYf) = . 1 i i :+_.^ i = 1.1 i+i 1j : : □ 

W 2 g g* g + 1 2 g 2g g(g + 1) (g + l)g* 



F. Recurrence Times, m-Pattern Frequencies 
For k,l£Z let 

oo , 

A(k,l) = E T ' P ( m (* + r ) = ' I = fcAm(t+l), . . . ,m(t + r- 1) / / 

T = l ^ 
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denote the average recurrence time to go from m(t) = k to m(t + r) 
the Translation Theorem 9 the probabilities are independent of t). 
Theorem 19 

(i) A(/s, As) = ^-2-gMl+l, 

(m) A(fc,Z) = k — l, 

(Hi) A(l,k) = A(k,k) - A(k,l), 

(iv) A(-k,0) = k + 2 ■ q/(q - 1), 

(u) A(-ife,Z) = A(-k,0)-l, 

(vi) A(k,-k) = 2-q-(q k -\)/(q- 

(vii) A(k,—l) = A(l,-l) + k-l, 

(viii) A(-k,-l) = A(-l, -I) + k - I, 

(ix) A(-k,l) = A(0,0-A(0,-fc), 

(x) p(m(t) = k) = A(k,k)' 1 



/(by 



1) 



k > I > 
fc> Z > 

< I < k 

ken 
k,leN 
0<l<k 
< k < 1/ 



Proo/. 

(i) To obtain an m(t) = k > 0, a polynomial of degree d > k must 

occur, hence Kd must include a pattern k ~ 1 a,a / (see 16(m)). This 

l k 
pattern has probability , its recurrence time in Kd is thus . Since an 

equally large part in Kc must be passed, we have A(fe, k) = 2q k /(q — 1) for 
k G N. Negative m(t) = —k < similarly require the pattern a0 k , whence 
A(-fc, -fc) = 2q k+1 /{q - 1) for jb G N . 

(ii) From m(t) = /c > the first — I symbols lead to m(t + k — I) = I > 0. 
(m) Every path from k to k passes I, hence A(k, k) = A(k, I) + A(Z, k). 

(iv) We jump after n steps with probability and then reach m = 
after a total of fe + 2n steps. Hence A(-fe,0) = Y^n=i ^ ' (k + 2n) = k + -^. 

(v) is a consequence of (iv) and (ii), since A(— A;,0) = A(—k,l) + A(Z,0) 
for 1 < Z < k. 

(vi) We have A(k,-k) = k + A(0,-fc) and A(-ife,-fc) = A(-fc,0) + 
A(0, -k), hence A(fc, —A;) = k+A(-k, -k)-A(-k, 0) = fc+^^-(A;+^ T ). 

(vii) For k > I we have to add — Z steps before A(Z, — I), for k < I the 
first Z — steps have to be omitted. 

(viii) Since | — k\ > | — Z|, the path leads through zero, hence A(—k,—l) = 
A(—k, 0) + A(0, — /). The result now follows from (iv) and (vii). 

(ix) To get from m = to m = I G Z, we have to pass the values m = 
-l,-2,...,-|Z| + 1, hence A(0,Z) = A(0, -k) + A(-k, I) fur 1 < k < |Z|. 

(x) This follows from 



p(m(t) = k) 



lim — ■ \{t | m(t) 



k, 1 < k < n}\ = A(k,ky 



□ 
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Corollary 20 In the binary case the formulae can be considerably simpli- 
fied. Remarkably, over ¥2 the value A(k,l) is integral for all k, I E Z. 



(<) 


A 2 (k,k) 


= 2l fc -^l+f 




fcez 


(ii) 


A 2 (M) 


= k — I, 




k > I > 


(in) 


A 2 (l,k) 


= 2 k+1 + k- 


-l, 


k> I > 


(iv) 


A 2 (-k,0) 


= k + A, 






(v) 


A 2 (-k,l) 


= k-l + 4, 




< I < k 


(vi) 


A 2 (k,-k) 


_ 2 k + 2 — 4 




ken 


(vii) 


A 2 (k,-l) 


= 2 l+2 + k- 


-I -A, 


k,leN 


viii) 


A 2 (-k,-l) 


= 2 l+2 + k- 


-I, 


<l <k 


(ix) 


A 2 (-k,l) 


= A 2 (0,0- 


A 2 (0,-k), 


0<k<\ 



Niederreiter defines in [251 Sect. 5] formulae for the average frequency of 
L(t) = A L(t + 1) = t -^§ L in the sequence L(i)f =1 for n — >• 00. Using 
m-notation this corresponds to m(t) = cq and m(t + 1) = c% — 1. We give a 
formula for arbitrary patterns m(t), m(f + 1), . . . , m(t + k — 1). 

Theorem 21 Let an m-pattern (wiq, mi, . . . , m&) 6e given with m,j + i £ 
— 1, 1 — anc? m.j > =^ mj + i = rrtj — 1. Let 
#(!) := IWil^i < A m,; + i = mj — 1, < i < k — 1}| and 
#(T) := |{mi|mi < A m m = 1 - m*, < i < k - 

The probability of occurrence for the pattern (mo, mi, . . . , rut), that is 

lim — ■ \{t I 1 < t < N,m(t + j) = m,-,0 < j < fe}|, 

is 

n(m m ml- 1 ^ = 

p(m , mi , . . . , m fe ) - ^^^y • g#a)+# ( T) 

Proof. The conditions make the pattern feasible (all other patterns have 
probability zero). The pattern occurs, whenever mo occurs with p(mo) = 
A(mo,mo) _1 , and furthermore, if for nii < the linear complexity devia- 
tion jumps #(|) times (with probability p = ^-) and is decremented #(|) 
times (with probability p = |). □ 



V. Levy Classes and Sharp Asymptotic Bounds for J and m 

As we have seen, the linear complexity deviation and the jump complex- 
ity of a sequence a can be described immediately in terms of Kj)(a). Pur- 
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thermore, for each length t and any given sequence b G F* the probability 
p(Ki)(o)i i i = i.„( = b) = hence equidistributed. We may thus convert 
theorems about the behaviour of Bernoulli sequences, like the Law of the 
Iterated Logarithm, directly into corollaries about the asymptotic behaviour 
of m a (t) and J a (t). 

The theorems used here are compiled in Revesz [3^. Since Revesz only 
treats the case F2, we also refrain from utmost generality and consider only 
binary sequences, anyway the most important case from a practical point 
of view. An exception is the Law of the Iterated Logarithm for the jump 
complexity which we show for F^°, q an arbitrary prime power. 

A. Levy Classes 

We shall use repeatedly the notation (V^ a G F?° . . .) or (fi-almost all 
a G F^° . . .) to imply that the statement ... is valid on a subset A C F^° of 
measure fi°°(A) = 1, hence false at most on a set of measure zero. 

The functions m a , L a J a and similar are defined on N (we ignore the value 
at zero). When we vary a on F^°, we obtain functions on F^° x N. Let us 
first examine the partition into an a-invariant part and the oscillation that 
depends on a. Given a function / : F2° x N — » R, we define its Levy classes, 
four classes of functions (i.e. classes of real-valued sequences) that describe 
the asymptotic behaviour of / (upper and lower class, resp.): 

(*) UUC(f) ={«Gl N | a, 3t G N, Vt > t : f(a, t) < a(t)} 
(ii) ULC(f) = {a G M N I V M a, Vt E N, 3t > t : f (a, t) > a(t)} 
(in) LUC(f) = {a G M N | a, Vt G N, 3t > t : /(a, t) < a(t)} 
(iv) LLC(f) = {a G M N | a, 3t G N, Vt > t : f(a, t) > a(t)} 
Thus for all choices «i G LLC(f), a 2 G LUC(f), a 3 G ULC(f), a 4 G 
UUC(f) and for almost all sequences a G A°°, we have a.\ < f(a) < 
asymptotically, but ^-almost all sequences will make / oscillate so much as 
to repeatedly leave the interval (02,03) of unavoidable oscillation. 

In the sequel we will use the following typical examples for functions /: 
maximum length of runs of zeroes (that is jump height, degree of partial 
denominators, deviation m) in Kd, as well as deviations \J a (t) — \ • ^r\- 
For the case q = 2 there exist very precise estimates for the Levy classes. 

The model used by Revesz is a discrete Brownian motion on Z: Let Xi G 
{ — !,+!} be random variables with p(X{ = +1) = p(X{ = —1) = |. We 
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start at time t = at zero. Given a sequence b G FJf, let 

n n 

Sb{n) = J^(2 • b t - 1) = -n + 2 • n € N 

t=i t=i 

be the deviation from zero after n moves, where bt = corresponds to 
X t = -1, and 6* = 1 to X t = +1. 



5. Jump Complexity 

The jump complexity counts the number of partial denominators in the 
encoding K(o) = 7t(^4i)|7t(j4.2)| . • ., which is equivalent to the number of 
nonzeroes in K£>(a) = 7T£ ) (A 1 )|7r£)( J 4 2 )| . . . = dl_1 /ci|0 d2 ~ 1 /c2| . . . (using 
twice as many symbols in K as in Kd). Hence we have the following model 
for J in terms of S: 

Theorem 22 For every sequence a G F£°, b = K(a) ; and every length 
2 ■ t, t G Nq we have: 



S b(t)+t , x x _f 0, m a {2-t) < 

m a (2 ■ t) > 



Proof. In Theorem 11 {in) we saw J a (2 • i) = X^i=i Kofflji + 5 (we may 
just sum up instead of counting nonzeroes since we work over F2). Now, 
identifying Kd(o) with (bk), the theorem follows from Sb ^ +t = Y%=i bk- ^ 
Theorem 23 Law of the Iterated Logarithm for tossing a fair coin 

f(t)eUUC(S a (t)/Vt) <=► S - =1 /M. e -^ <00 



f(t) G ULC(S a (t)/y/i) ^ E - =1 ZM. e --V 
/(t) G LUC(S a (t)/V~t) <=> -f(t) G ULC(S a (t)/Vt) 
f(t) G LLC(S a (t)/Vt) -/(t) G UUC(S a (t)/Vi) 

Proof. The proof goes back to Erdos [Sj, Feller ^2], an d Kolmogoroff 
The theorem is [5.2] of Revesz (2^. □ 
Some example functions bounding 5 a (t) / y/Jt) show that we can not avoid 
oscillations on the order of the "iterated logarithm" log log (t). 
Example For all e > we have: 

(2-loglog(t) + (3 + e)-logloglogt) 1 /2 g UUC(S a (t)/\/i) 

(2-loglog(t) + log log log t)V2 g U LC{S a {t) / \fi) 

-(2-loglog(t) + log log log t) 1 / 2 G LUC(S a (t)/Vt) 

-(2-loglog(tj + (3 + e)-logloglogt) 1 /2 G LLC(S a (t)/^t) 
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From Theorem 23 we now infer a Law of the Iterated Logarithm for the 
Jump Complexity in the binary case: 

Theorem 24 For /i-almost all a G with b = K(a) the jump complexity 
J a (2t) observes: 



f(t) G UUC(S b {t)/Vt) 
f(t) G ULC(S b (t)/Vt) 
f(t) G LUC(S b (t)/Vt) 
f(t) G LLC(S b (t)/Vt) 



(Vt • /(i) + t)/2 + 1 
(Vt-/(t) + t)/2 
(Vt-/(t) + t)/2 + l 
(Vt-/(*) + t)/2 



G UUC(J a (2t)) 
G ULC{J a (2t)) 
G LUC{J a {2t)) 
G LLC{J a {2t)) 



In particular, the classes contain the following functions: 

t/4 + 1 + Vt • ^/lo g joggj/j + (3/8 + e) • log log lo g t 

t/4 + V* • Vlog joggj/j + 1/8 • log log log t 

t/A + 1 - y/i ■ ^log log(t)/4 + 1/8 • log log log t 

t/4 - y/t ■ log(t)/4 + (3/8 + e) • log log log t 



G UUC(J a (t)) 
G ULC(J a (t)) 
G LUC(J a (t)) 
G LLC(J a (t)) 



Proof. By Theorem 22, we can replace J a by 



S b (t)+t 



+ 5 and thus obtain 



with Theorem 23 and the above example, resp. the statements. See also the 
next theorem for arbitrary finite fields ¥ q . □ 
Theorem 25 The Law of the Iterated Logarithm for the Jump Complexity 
over arbitrary finite fields ¥ q 



(0 

(*) 



lim r , 
liuu 



(Ja(n) 



H g" 1 ~| / 
2 o / / 



-n • log log n = +1 



n 
2 



a.e. 



a.e. 



Proof. Theorem 22 can be immediately generalized to all prime powers 
q. Thus we can apply the Law of the Iterated Logarithm for Bernoulli se- 
quences with p = - as proportion of zeroes (we do not descriminate between 



nonzero symbols) (see Feller to obtain (i) and (ii) 



□ 



C. Linear Complexity Deviation m 

We now consider the linear complexity deviation m. m takes on its max- 
imum values at a jump (except for rational a, but we can ignore the set 
K -1 (§ n F g [x]) of measure zero). Hence the length of zero runs in Kd(o) is 
of importance, since every 7Td consists of a zero run (of length d — 1 G No) 
and the leading coefficient that makes m jump. We first define the length 
of the largest uninterrupted sequence of —1 in {Xi)i = \ n or of zeroes in 
b\, . . . , b n , resp. as 



Z b (n) :-- 



max 

0<r<n 



max 

0<k<n—r 



S b (k)-S b (k + rj)} 
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Theorem 26 Let a G with b = Ko(a) and m a (2n) = 0, i/ien : 
max |m a (i)l = + 1 

t<2n 

Proof. The largest value |m a (i)| occurs after the longest zero run in 
Kd, which is of length maxj< 2n |m a (i)| — 1 and terminates with a leading 
coefficient, since m a (2n) = 0. Zj, just gives that longest run length, and at 
t = 2n, Kd contains n symbols. □ 

Theorem 27 Levy classes for Z{n) 

oo 

^2~ /(n) < oo <=> f(n) G UUC(Z(n)) 

n=l 

oo 

J2^ f(n) = oo <=> f{n) G ULC{Z{n)) 
n=l 

f(n) = Llog 2 (n)-log 2 log 2 log 2 (n)+log 2 log 2 (e)-l+eJ G LUC{Z{n)),Ve > 
f{n) = Llog 2 (n)-log 2 log 2 log 2 (n)+log 2 log 2 (e)-2-eJ G LLC(Z(n)),Ve > 

Proof. See Erdds and Revesz ^01, Revesz (22] ■ □ 
Theorem 28 £ei>y classes for m + {n) := maxj< n 

oo 

Y^2- f(n) < oo <=> /(n) E CftfC(m + (n)) 

n=l 

oo 

J^2- / W = oo ^ /(n) G ULC{m + {n)) 

n=l 

ft ft 

f(n) = l+ Llog 2 (-) - log 2 log 2 log 2 (-) + log 2 log 2 (e) + ej 

G LUC(m + (n)), Ve > 

TL Tl 

f(n) = 1 + |tog 2 ( j ) _ lo g2 !og 2 log 2 ( ^ ) + lo §2 log 2 (e) - 2 - ej 

G LLC(m+(n)), Ve > 

Proof. The theorem follows from Theorems 26 and 27, where the conver- 
gence of the sums does not depend on the constant +1 or the fact that the 
function /(n/2) is sampled twice as often (both supply only a factor 2 ±1 ). 
The statements about UUC and U LC already appear as Th. 8,9]. □ 
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Theorem 28 allows us to show that a fixed bound on m generally is too 
restrictive, but logarithmic growth is feasible for m. We therefore define the 
notions of perfect and good profiles. 

A sequence a G is called d-perfect for a d G N if Vf G N : |m a (t)| < d. 
A sequence a has a good linear complexity profile, if 

3 C G M,Vre G N : \m a {n)\ < 1 + C ■ log re. 

Corollary 29 

(i) There are [i-almost no sequences with d-perfect linear complexity profile. 

(ii) [i-almost all sequences have a good linear complexity profile 
Proof. 

(i) For all d G N, f(n) = d becomes smaller than every sequence in LLC, 
which itself is superated by /i-almost all sequences from some uq on. 

(ii) For /(re) := 2 • log 2 (n) the sum Ylri=i 2~^ n "> converges, and hence for 
/i-almost all sequences a we have m a (t) < f(t) G UUC for all t > to(a) for 
some t (a). □ 

Theorem 30 |241 Th. 2] Let f,g be functions on N with lim^^oo /(re) = 
lim^^oo g(n) = oo. Then: 

V M oGF^ : -f(n)<m a (n)<g(n) 

Proof. See Niederreiter [25 Th. 2]. □ 
Remark Theorems 27 and 30 together show: The largest linear complex- 
ity deviation ever occurring is of the order log 2 n for almost all sequences — 
however, for almost all sequences from a fixed reo onwards the linear com- 
plexity deviation is only of the order /(re) := log 2 ^ re for arbitrarily large k. 
Theorem 31 

(i) For the lengths Zi(n), Z%(ri), . . . of the second, third ... largest run 
of zeroes Deheuvels jS] has found the following functions (where logg := 
log Jlog^ ) with log 2 (x) = for x < 1, log 2 (x) = log 2 (x) for x > 1. 

For all k G N, r > 2, e > we /iaue: 

/(re) = log 2 (re) + i-(log( 2) (re) + ...+log^ 1) (re) + (l+e)logf ) (n)) 
G UUC(Z k (n)) 

/(re) = log 2 (re) + ~ • (log( 2) (re) + . . . + log^V) + logf } (re)) G ^C(Z fc (n)) 

/(re) = |k>g 2 ( n ) - lo S 2 lo S 2 log 2 («) + !og 2 log 2 (e) + ej G LC7 C(Z k (n)) 
/(re) = |k>g2(™) - !og 2 !og 2 log 2 (re) + log 2 log 2 (e) - 2 - ej G LLC(Z k (n)) 
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(ii) Hence for the second etc. largest degree dk(t) in the continued fraction 
expansion, we obtain: 

m = i+io^^+^-aogf^+.-.+iog^^+a+^iogW^)) 

G UUC(d k (t)) 

f(t) = 1 + log/-) + I • (log< 2) (^) + . . . + log^ 1 ^) + logW(^)) G ULC(d k (t)) 
f{t) = 1 + Llog 2 (^) - log 2 log 2 log 2 (^) + log 2 log 2 (e) + e\ G LUC{d k (t)) 
f(t) = 1 + Uog 2 (^) - log 2 log 2 log 2 (^) + log 2 log 2 (e) — 2 — e\ G LLC(d k (t)) 

Proof (i) See Deheuvels [5] and Revesz [3T1 S. 61]. Observe that LUC, LLC 
do not depend on k. 

(ii) follows from (i) with Theorem 26, compare the proof to Theorem 28. □ 

VI. 2-adic Span and Complexity 

Klapper and Goresky ^1] ^S] introduced another measure to assess the 
(non-)randomness of a bit string, the representation of (oi, a 2 , «3, . . .) as a 
2-adic integer a = J^Si a i2*~ 1 G Z 2 and its approximations by rational 
numbers from Q, as given in Mahler [IE] and De Weger 0. 

Klapper and Goresky view a sequence (ai,a 2 , . . .) G F2 as a more and 
more precise description of a 2-adic integer in Z 2 (the base field now must be 
prime, not just a prime power, and we shall treat only p = 2). For every finite 
prefix (01, ... , a k ), we obtain the number := Yli=i a «2 J_1 G Z C Z 2 . We 
are now interested in describing the number by a fraction |^ with the 

condition % • a( fc ) = mod 2 k . 
We need some definitions: 

For k G N,oi,...,o A G F 2 , let = Ya=i a-i^' 1 G Z. We define the 
lattice (or Z-module) C a (k) = {(p k ,q k ) £ Z 2 | q k ■ = p k mod 2 k } and 
set £'„(£;) = £ a (AO\{(0,0)}. 

For (p,q) G Z 2 , let $(p, q) = max(\p\,\q\). 

Let (c k ,d k ) be a minimal pair from C a (k) in the sense of Q((c k ,d k )) < 
$((P*,9*)) for all (p k ,q k ) G 
We call the sequence (c k ,d k ) the minimal approximating sequence of a. 
The 2-adic complexity now is defined as ^2(0, k) := log 2 (<£(cfc, d k )) G M 
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in [Tl|. However, we are interested here in an isometric model of 2-adic ap- 
proximation. Thus we define A: ¥?? — ► Fif 3 as 



A(oi,a 2 , • • -)k 



0, if (c k ,d k ) = (fik^dk-i) 

1, if (ck-.dk) ± (ck-i,d k -i) 



for k G N, where we set (co,cfo) := (0, 1)- 
Theorem 32 A zs an isometry on FSf. 

Proof. Let a,6 G with a>i = bi,l < i < k and = 1 — 6/%. Then 
A(a)j = A(6)j for 1 < i < k, since both sequences lead to the same sequence 
(ci,di) of rational approximations. 

However, from dk~i ■ Yli=i a i 2% _1 = c fc-i m od 2 fc_1 , we may infer 
d k -i ■ Yli=i H 2 '^ 1 + & ■ 2 fc_1 = Cfc-l mod 2 fc , for a ,5 G {0, 1} and thus 
4-1 • Et"i « 4 2 i ~ 1 + (1 - 5) ■ £ Ck-i mod 2 k . 

Thus A(ai, . . . ,a k -i,6, . . .) k = and A(oi, . . . ,a k -i, 1 - S, . . .) k = 1. 

One of these corresponds to A(a)k, the other to A(b) k , hence |A(a) — 
A(6)| = —k = \a — b\ and A is an isometry. □ 

We further define the 2-adic jump complexity J a to count the number 
of changes in the 2-adic complexity profil: J^(a)(n) := Yli=i A(a)«, which 
should behave like Ja{ u ) ~ In analogy to m, we thus define the 2-adic 
jump complexity deviation tua (n) := 2 ■ Ja{ u ) — n G Z. 

Theorem 33 mA(t) and Sb(t) have the same average and asymptotic be- 
haviour, precisely 

f G LLC . . . UUC{m A {t)) < ^=> f G LLC . . . UUC{S h {t)) 



Proof. Ja(*) and the sum Ya=i of the coin tossing sequence b t both 
represent the same behaviour, summing up a p = \ Bernoulli experiment. 
With m^fn) = 2-Ja(u) — n and Sb (n) = 2- ^™ =1 6j — n the result follows. □ 

We have thus 

Corollary 34 Law of the Iterated Logarithm for the 2-adic Jump Complex- 
ity Deviation 

(i) lim n _ ) . 00 m^(n) / (yJ2n ■ log logn) = +1 /i — a.e. 
(ii) lim,,^ rriA{n) / (y/2n • log logn) = —1 u — a.e. 

Remark Whereas the original 2-adic complexity <f> is not usually integral, 
J A and mA put us again in the world of coin tossing. However, in order to 
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compute A, until now we have to iteratedly calculate $ and derive A from 
the sequence of convergents. 
Open Problem 

Can we calculate A(a)j j=i... n with bit complexity 0{n 2 ) ? 
Can we even calculate the shifted 2-adic profiles A(oi . . . a n ), A (02 ■ . . a n ) 
. . . A(a n _i . . . a n ) together in 0{n 2 ) time ? 

VII. Conclusion 

We have shown that both the expansion of formal power series into their 
continued fraction expansion and the approximation of 2-adic numbers by 
rationals induce an isometry on F^°, K and A, resp. 

We modelled linear and jump complexity as well as 2-adic jump complexity 
via Bernoulli experiments, and applied the known general bounds to this 
model to derive sharp bounds (Levy classes) for J,m, and J a- 

We gave an adaptation of the Berlekamp-Massey-algorithm that imple- 
ments the continued fraction expansion exactly as for the reals to obtain K. 
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